Safety-critical Policy Iteration Algorithm for Control under Model Uncertainty

نویسندگان

چکیده

Safety is an important aim in designing safe-critical systems. To design such systems, many policy iterative algorithms are introduced to find safe optimal controllers. Due the fact that most practical finding accurate information from system rather impossible, a new online training method presented this paper perform reinforcement learning based algorithm using real data instead of identifying dynamics. Also, impact model uncertainty examined on control Lyapunov functions (CLF) and barrier (CBF) dynamic limitations. The Sum Square program used iteratively solution. simulation results which applied quarter car show efficiency proposed fields optimality robustness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning control under uncertainty: A probabilistic Value-Iteration approach

In this paper, we introduce a probabilistic version of the wellstudied Value-Iteration approach, i.e. Probabilistic Value-Iteration (PVI). The PVI approach can handle continuous states and actions in an episodic Reinforcement Learning (RL) setting, while using Gaussian Processes to model the state uncertainties. We further show, how the approach can be efficiently realized making it suitable fo...

متن کامل

Policy Iteration Algorithm for Shortest Path Problems

Abstract. The shortest paths tree problem consists in finding a spanning tree rooted at a given node, in a directed weighted graph, such that for each node i , the path of the tree which goes from i to the root has minimal weight. We propose an algorithm which is a deterministic version of Howard’s policy iteration scheme. We show that policy iteration is faster than the Bellman (or value itera...

متن کامل

Approximate Policy Iteration for Markov Control Revisited

Q-Learning is based on value iteration and remains the most popular choice for solving Markov Decision Problems (MDPs) via reinforcement learning (RL), where the goal is to bypass the transition probabilities of the MDP. Approximate policy iteration (API) is another RL technique, not as widely used as Q-Learning, based on modified policy iteration. In this paper, we present and analyze an API a...

متن کامل

Policy iteration based feedback control

It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iteration approach in MDPs to the optimal control problem of stochastic systems. We first provide an optimality equation based on performance potentials and develop a policy iteration procedure. Then we apply policy iterat...

متن کامل

An Introduction to Hybrid Model Inventory Control with a Green Supplier Selection Model under Uncertainty

In the current decade, determining the most appropriate supplier as a strategic factor in the supply chain has attracted lots of consideration. On the other hand, organizations do necessary measures to implement green supply chain management in order to improve environmental and economic performance. An important way to implement green supply chain management, could be revising the method of pu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Artificial intelligence advances

سال: 2022

ISSN: ['2661-3220']

DOI: https://doi.org/10.30564/aia.v4i1.4361